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[57] ABSTRACT 

Automated capture of a string of intended alphabetic or 
alphanumeric characters is provided by using indications of 
telephone keys respectively selected in accordance with the 
intended characters and also using signals respectively rep- 
resenting utterances of each of the intended characters to 
generate selected characters. The telephone key indication is 
a dual tone multifrequency signal or an utterance represent- 
ing a number corresponding to the telephone key. A selected 
character corresponding to one of the telephone key indica- 
tions is generated based on the utterance signal correspond- 
ing to the one telephone key indication. 

10 Claims, 2 Drawing Sheets 
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CAPTURE OF ALPHABETIC OR processing system, is assumed to have decided that he or she 

ALPHANUMERIC CHARACTER STRINGS IN wishes to enter their name or other alphabetic information to 

AN AUTOMATED CALL PROCESSING the system, for a purpose such as placing an order or 

ENVIRONMENT receiving information. In this scenario, the user has available 

5 only a conventional telephone set, i.e., any telephone set 

BACKGROUND OF THE INVENTION unable to directly transmit alphabetic information across the 

The present invention relates to automated call telephone network, and communicates via this telephone set 

processing, and, more particularly, is directed to capturing W1 ™ the svstem> 

alphabetic or alphanumeric characters in an automated call Referring now to the drawings, and in particular to FIG. 

processing environment. 10 1» mere ^ illustrated a system 900 in which the present 

Automated call processing has achieved widespread invention is applied. As mentioned a user is assumed to 

usage. Applications include call routing, voice mail, direc- have access to ^ a conventional telephone set 910 which 

tory assistance, order processing, information dissemination communicates with the system 900 using conventional te e- 

and so forth communications facilities such as wired or wireless tele- 

. . , , , , . t . , 1 5 communications systems known to one of ordinary skill in 

However, existing telephone based services in which a ^ art 

caller is interacting with a computer do not capture alpha- „ * t nnn . . A . . 4 f 

.... t r . , . . . j - r r u The system 900 comprises communications interface 

betic character strings with a high degree of accuracy when /p nU u ixrrrrx mn u ** ai /enenpu 

• i ** u*u (COMM INTFC) 920, speech generation module (SPEECH 

the strings comprise letters which are selected from an ;,., xA n . n ' v . . & , . ,™ c ™„ nTJ ™^\ 

unlimited or very large domain, such as names. Since the set 20 9^0 , 930 ' ^V^top ™ d J ul ? <™ CH RECOG) 

of character strings cannot be defined in advance, the string 940 |. stor 0 a f n e ,nterface £ T0RAGE ^ fi T ft FC) / 50 ' St0rage 

must be speUed at it is captured. medium 960 memory 970, processor 980 and commumca- 

. , . Uons links therebetween. 

Automatically capturing alphabetic spelled character „ ... . . - n * A . , . , , . „ 

, . . , . . r t_i .1 Communications interface 920 is adapted to receive calls 

strings using only voice input is not feasible presently - , , , , Mn t i *u • j u 

L i . • . i - iL -i Li from a user telephone set 910, to supply synthesized speech 

because letter recognmon accuracy a too ow wim available is from fa ' ration modu]e 930 

to the telephone set 910, 

voice recognition ^ technology For example, it is difficult to tQ * frQm ^ ^ ^ ^ {q 

automatically distinguish B from P . recognition module 940, and to exchange information with 

Methods of automatically capturing alphabetic spelled processor 980. The system shown in FIG. 1 includes a 

character strings using only dual tone multifrequency communications bus and separate communications lines for 

(DTMF) input from a twelve-key keypad on a telephone set 30 carrying vo i C eband signals between the communications 

are cumbersome, as each telephone key does not uniquely interface 920 and each of speech generation module 930 and 

map to a single alphabetic character. Consequently, multiple speech recognition module 940, but one of ordinary skill in 

inputs per letter are required for disambiguation, e.g., to lhe art wiU ap p rec i ate that other configurations are also 

indicate "K" press "5" twice or press "5", "2". These suitable 

method are also error-prone due to the problem 35 ^ ^ mocMe 93Q ig ad d tQ feceive 

accidentally pressing the wrong key or multiple keys and control commands from ^ 980> to te a voi . 

being unaware of the error, the so-called fat finger effect. ceband sigQa] m resp0Qse theretQj and tQ deHver ^ gener . 

SUMMARY OF THE INVENTION atecl signal to communications interface 920. Preferably, 

dn speech generation module 930 generates synthesized speech 

Automated capture of a string of intended characters, at in a f requency band of approximately 300-3,300 Hz. In 

least one of which is alphabetic, is provided in accordance embodiments, speech generation module 930 may also 

with the principles of this invention by using indications of function to transmit ("play") prestored phrases in response 

telephone keys respectively selected in accordance with the {Q commands fr om processor 980; module 930 includes 

intended characters and also using signals respectively rep- appropriate signal storage facilities in these cases, 

resenting utterances of each of the intended characters to Speech recognition modu i e 940 is adapted (i) to receive 

generate selected characters. from ^^^^ interface 920 a voiceband signal 

In exemplary embodiments, the telephone key indication which can be a speech signal or a DTMF signal generated in 

is a dual tone multifrequency signal or an utterance repre- response to depression of a key on the telephone set 910, (ii) 

senting a number corresponding to the telephone key. A 5q to p r0C ess this signal as described in detail below and in 

selected character corresponding to one of the telephone key response to commands from processor 980, and (iii) to 

indications is generated based on the utterance signal cor- deliver the results of its processing to processor 980. As will 

responding to the one telephone key indication. be appreciated, in some embodiments speech recognition 

BRIEF DESCRIPTION OF THE DRAWINGS modu , le 9 f ^ludes storage for holding predetermined 

5S signals and/or for holding speech signals from telephone set 

FIG. 1 is a block diagram illustrating a configuration in 910 for the duration of a call, 

which the present invention is applied; and Storage interface 950 is adapted to deliver information to 

FIG. 2 is a flowchart of a method of automatically and retrieve information from storage medium 960 in accor- 

capturing an intended alphabetic character string. dance with commands from processor 980. The storage 

^™ iir r ™ x^™™„™™ ^ ™^ 60 medium 960 may be any appropriate medium, such as 

DETAILED DESCRIPTION OF THE tic disk ^ £ ^ nsisl0l 

PREFERRED EMBODIMENTS 9?0 ^ ay be ^vLtd by ming> for ^ xample> 

The present invention is related to the invention of U.S. ROM and RAM, and is adapted to store information used by 

patent application Ser. No. 08/581,716, filed Dec. 29, 1995, processor 980. 

the disclosure of which is hereby incorporated by reference. 65 Processor 980 is adapted to execute programs for inter- 
In an automated call processing scenario, for example, a acting with the user of telephone set 910 in accordance with 
caller, also referred to herein as a user of the automated call a control program typically stored on storage medium 960 
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and also loaded into memory 970. Processor 980 may also control and processing program executed by processor 980 

communicate with other systems via communications links will be apparent to one of ordinary skill in the art in view of 

(not shown), for example, to retrieve user-specific informa- FIG. 2. 

tion from a remote database and/or to deliver information ^ advantage 0 f this me thod is that the entire character 

captured from the user of telephone set 910 to a remote 5 string is processed together, so information in the string (i.e., 

database. its grammar) is available for use in disambiguation. 

In a typical call processing operation, the user employs . , , t c . . . , . , r * * 

telephone set 910 to place a call to system 900. Communi- Another advantage of this method is that it seems faster 

cations interface 920 receives the call and notifies processor the caller than character-by-character capture, since some of 

980 of an in-coming call event. Processor 980, in accordance Q the interaction overhead is amortized over the entire string, 

with its control program, instructs speech generation module rather than being incurred for each character. 

930 to generate a speech signal. Speech generation module This method also permits voice recognition technology 

930 generates the requested speech signal and delivers the which has imperfect letter recognition accuracy to be uti- 

generated signal to communication interface 920, which Kzed to provide highly accurate capture of an alphabetic 

forwards it to the telephone set 910. character string. 

In response to the generated speech signal, the user enters 3 At step 610 of mG 2 , the system prompts the user to 
information to system 900 via telephone set 910. As depfess a of telephone keys corresponding to a 
described in detail below, the information can be a speech string of a i pha betic characters to be input to the system, and 
signal or a DTMF signal generated in response to depression at gtep m the system rece ives the DTMF string corre- 
of a key on the telephone set 910. ^ sponding t0 me sequence of depressed telephone keys. For 
Communications interface 920 (a) receives the user- example, if the character string "GRAND" is to be input, the 
generated signal; (b) notifies processor 980 that a signal has user depresses the telephone keys "4", "7", "2", "6", "8"; in 
been received; and (c) delivers the signal to speech recog- tn j s examp le, the last key depression is confused, 
nition module 940. The module 940 processes the signal in specifically, "8" was depressed instead of "3". The sequence 
accordance with the present invention, as described in detail 25 may De explicitly terminated, such as by entry of the pound 
below, and delivers the result of its processing to processor ^ key> or implicitly terminated, such as by passage of a 
980. Based on this result, processor 980 proceeds through its t j me interval greater than a predetermined amount during 
control program, generally instructing speech generation which no further keys are depressed, 
module 930 to request information from the user or to Amatively, the system prompts the user to utter the 
deliver information to the user, and receiving processed user 3Q numbers of ^ tel hone key such as "four", 
input from speech recognition module 940. , w>) u ^ « eight „ M mentioned above , pre s- 
Entry of alphabetic or alphanumeric information accord- ently available vo ice recognition technology has a high level 
ing to the present invention, from the user to the system, will of accuracy for recognition of single digits, as compared 
now be described. w | tn tne \q VC \ 0 f accuraC y f or recognition of spoken alpha- 
Referring now to FIG. 2, there is illustrated a flowchart for 35 betic characters, 
a method of automatically capturing an intended alphabetic M 63() ^ g ^ ^ DTMF Qr 
character string. As used herein and in the clauns number ^ and creates a ^ fof limited 
"intended refers to what a user has m mind, irrespective of domain ^lion. For the case of a string consisting of 
errors arising from user mistake (e.g., depression of an ^ mM defines ^ alowabu 
incorrect telephone key) and/or system mistake (e.g., incor- m characte(s as {G H t} {p R s} {A B c} {M N G }, {T, 
rectly identifying a received signal). U V} 

The character string capture method illustrated in FIG. 2 ' ' ,. , .„ . . ... . , ... 

.... ..f , . TYrx.c • i u„ One of ordinary skill in the art will appreciate that 

generally involves the caller entering DTMF signals by . ... ' , , . , \f. 

depressing telephone keys corresponding to the alphabetic creatl0a of th h e S r ™ r ma / be dela /f d ™ tt h the ^ stem 

characters of the string lo be captured, L then the caller 45 f ^ strm § for Potation to the caller, as 

utters each character of the string. Alternatively, the caller described below. 

may speak the number of the telephone key, The DTMF (or Forward grammar prediction, in which previously entered 

spoken number) input indicates to the system how many letters ^ used to P redlct subsequently entered letters, and 

characters are in the string, and for each character, a limited backward grammar prediction, in which subsequently 

set of possibilities for that character. For example, if the 50 entered letters are used to P redlct previously entered letters, 

telephone key for "2" is depressed (or the word "two" is may be used at this point to generate a set of probable 

spoken), the set of character possibilities consists of "A", strings. For example, according to forward grammar predic- 

"B" and "C", The system uses grammar rules and the uttered tion for the English language, a first character of "G" imparts 

characters to aid in disambiguating the characters of the a likelihood ordering of {R, S, P} to the next character. 

DTMF (or uttered number) string, generates a string of 55 Initial characters of "G", "R" impart a very high probability 

selected alphabetic characters and presents the generated to "A" as the third character, and very low probabilities to 

string to the caller " c " ^ anotDer example, according to backward 

Ifthegeneratedcharacterstringiscorrect,thenthesystem Prediction a fourth character of "N" imparts a 

has correctly captured the intended character string. If the likelihood ordering of {A, C, B} to the third character, 

generated character string is incorrect, then the system eo At step 640, the system prompts the user to spell the 

presents its next most probable character string to the user string, that is, to utter the alphabetic characters comprising 

and inquires whether this string is correct. When the sys- the string. For example, using speech synthesis techniques 

tern's first generated character string is incorrect, the known to one of ordinary skill in the art, or using a 

responses of the user are additional inputs to aid in disam- pre-recorded message, the system tells the user, "Please spell 

biguating the DTMF (or uttered number) character string. 65 vour name." 

The flowchart illustrated in FIG. 2 encompasses the At step 650, the system receives signals respectively 

actions of the elements shown in FIG. 1. For example, a indicative of the uttered alphabetic characters. 
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At step 660, the system generates a string of selected 710, at which it determines whether there are any likely 

alphabetic characters which best matches the string entered strings which it has not yet presented to the user. If no likely 

by the user. In some embodiments, in case of disagreement strings remain, then appropriate processing occurs. FIG. 2 

between the DTMF (or uttered number) string and the illustrates the case where appropriate processing comprises 

uttered alphabetic character string, preference is given to the 5 inviting the user to try entering the string again at step 720, 

DTMF string, while in other embodiments, preference is and then returning to step 610. 

given to the uttered alphabetic character string. if there are likely strings which have not yet been pre- 

In particular, the system uses the signal representing the sented to the user, then at step 730, the system selects the 

uttered characters to disambiguate the DTMF string. For next most likely string as the generated string, and returns to 

example, after the signal representing the character "gee" is io step 670 to present this string to the user, 

received, the system accesses a subset of stored signals a variation of the method described above with regard to 

representing spoken alphabetic characters. In particular, the FIG. 2 occurs when the string to be captured comprises 

subset comprises stored signals corresponding to only the alphanumeric characters, such as an account identifier. Here, 

characters, usually three, indicated by the DTMF input. a more complicated grammar is needed as there are many 

Thus, the DTMF input is used to restrict the number of 15 possible ways of uttering numeric strings, 

stored signals used during pattern matching, resulting in Let it be assumed that the string to be captured consists of 

faster and more accurate match processing. a fixed num ber of characters, with alphabetic characters in 

The system compares the received signal with the selected certain positions, numeric characters in other positions and 

subset of stored signals, selects the stored signal which best either alphabetic or numeric characters in the remaining 

matches the received signal, and finds the alphabetic 20 positions. For example, the string might be of the form: 

character, "G", corresponding to the best matching stored aannn* 
signal. This alphabetic character is used to select one of the 

allowable characters {G, H, I}, specifically, the first of the where "A" indicates an alphabetic character, "N" indicates 

allowable characters, a numeric character and "*" indicates either an alphabetic or 

If the best matching alphabetic character is not one of the 25 a numeric character, 

allowable characters, the system may look up an alternate As in FIG. 2, the user is assumed to enter the DTMF 

best matching alphabetic character, and use this alternate characters for the account identifier string and also to speak 

best matching character to select one of the allowable the characters of their account identifier string (steps 

characters. For example, a look up table might list alternates 600-650). Using the account identifier "AD345M", the user 

for "G" as "B" "P" "T" "V" "Z". 30 would depress the telephone keys "2", "3", "3", "4", "5", 

If not already applied/ forward and backward grammar "6", then say, for example, "AY", "DEE", "THREE FORTY 

prediction may be used to determine character likelihood. FIVE , "EM . 

Either the DTMF entries or the signal for the uttered ™e s y stem generates a string of selected alphanumeric 

alphabetic characters may be used in predicting the charac- „ characters in the following manner. For each alphabetic 

ters intended by the DTMF or uttered character inputs. 35 position, the alphabetic characters corresponding to the 

Typically, the result of the processing at step 670 is that tele P hone u ke ^ depression are selected. For each numeric 

one string will be found to be more likely than others. P oslt,on > the number corresponding to the telephone key 

However, in some cases multiple strings will be above a depression is selected For each position which is either 

predetermined threshold, or appear similarly likely. In the dn alphabetic or numenc, both the alphabetic characters and the 

above-described example, if the user depresses the "8" key, 40 D ™ be ' corresponding to the telephone key depression are 

corresponding to {«T>, «U", "V"}, but utters "DEE" as the f w?^ f^Tcf ' ,w E^S™ { ' ' h 

corresponding alphabetic character, the strings "GRANT" i D » b > b h t 4 M 5 > \ M > N ' °> °/* . 

and "GRAND" are both good candidates for the best match- u Next ' the recei J ed Sl f als corresponding to the spoken 

ing string. When preference is given to the DTMF string, d , characters are used to select one of the possibilities for each 

"GRANT" is selected as the most likely string, also referred 45 al P^beUc key depression. In this example the received 

to as the generated string of selected alphabetic characters. s P oken ' AY resuh f m * elect ? 0D ^ A as the first 

. , tU , . u .u *u * a character, and the received spoken signal' DEE "results in 

At step 670, the system tnqu.res whether the generated „ D „ K the ^ char ^ 

stnng corresponds to the mtended stnng. For example, the for ^ 

system asks the user, "I understood G as in girl, R as in red, so . . / V _ _ , t , t • . t , n ' or ;, 

/ . . vr ^ ™ . t i *o»t! sions are ignored tor the moment, that is, the numenc key 

A as in apple, N as m nature, T as in top. Is this correct? It , . & „ „ ., . . „ Unrn „ ta „ *„ fU ? 

.„ . . . j .t. . • j *.l u depressions fully identify the numenc characters. In this 

will be appreciated that associating a word with each m rji rci /„ the numeric characters 

character, where the first letter of the associated word is the ^^mpk, {3}, {4}, i 5 ^^ 1 ^^^^' 

" . 4 , j . j * . * For the characters which are either alphabetic or numeric, 

character, assists the user in correctly understanding the . , , * - 

' J & the system creates a grammar based on possible ways oi 

sys em query. 55 uttermg tnese cnarac t e rs. Here, when the last character is 

The user replies with, typically a "yes or "no answer, ^ ^ di three characters are kaown t0 be 

which can be processed by presently available voice recog- {3} {4} {5} me Mq of utteri the lasl four 

mtion technology with a relatively high level of accuracy. At characters include- 

step 680, toe system receives the user's reply. THREE FOUR FIVE SIX 

At step 690, the system uses the reply to determine 60 Pnni3 mtrrv sty 

whether the generated string is the intended string. If the i«Ktti ^iri 1 aiA 

generated string matches the intended string, then the system THREE FORTY FIVE SIX 

has correctly captured the string and goes on to step 700, THIRTY FOUR FIVE SIX 

completion of string capture. THIRTY FOUR FIFTY SIX 

If the generated string does not match the intended string, 65 THREE THOUSAND FOUR HUNDRED FIFTY SIX 

then the reply received from the user at step 680 comprises THREE THOUSAND FOUR HUNDRED AND FIFTY 

an input used for disambiguation. The system goes on to step SIX 
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THREE FOUR HUNDRED FIFTY SIX 

THREE FOUR HUNDRED AND FIFTY SIX 

THREE HUNDRED FORTY FIVE SIX 

THREE HUNDRED AND FORTY FIVE SIX 
When the last character is "M", the possible ways of uttering 
the last four characters are 

THREE FOUR FIVE EM 

THREE FORTY FIVE EM 

THIRTY FOUR FIVE EM 

THREE HUNDRED FORTY FIVE EM 

THREE HUNDRED AND FORTY FIVE EM 
The system then compares signals for the possibilities where 
the last character is alphabetic or numeric, determined as 
described above, with the received signal ("THREE FORTY 
FIVE", "EM") and finds the best matching last character 
(step 660), in this example, {M}. 

By allowing for the different ways of uttering the numeric 
string, recognition accuracy of the alphabetic and alphanu- 
meric character positions is maximized. 

It will be appreciated that the recognition work is in 
distinguishing the alphabetic and alphanumeric character 
positions. The numeric character positions are, in this 
example, assumed to be correctly entered. 

Although an illustrative embodiment of the present 
invention, and various modifications thereof, have been 
described in detail herein with reference to the accompany- 
ing drawings, it is to be understood that the invention is not 
limited to this precise embodiment and the described 
modifications, and that various changes and further modi- 
fications may be effected therein by one skilled in the art 
without departing from the scope or spirit of the invention as 
defined in the appended claims. 

What is claimed is: 

1. A method of capturing an intended string of characters, 
at least one of which is alphabetic, comprising the steps of: 

receiving indications of telephone keys respectively 
selected in accordance with the intended string of 
characters, wherein each of the telephone key indica- 
tions is an utterance corresponding to a telephone key; 

receiving utterance signals respectively representing 
caller utterances of each of the intended characters for 
disambiguating the received telephone key indications; 

generating a string of selected characters based on the 
telephone key indications and the utterance signals; and 

determining whether the generated string of characters 
corresponds to the intended string of characters; 
wherein when the generated string corresponds to the 
intended string then the intended string has been cor- 
rectly captured. 

2. The method of claim 1, wherein the telephone key 
utterance represents a number corresponding to the tele- 
phone key. 

3. A method of capturing an intended string of characters, 
at least one of which is alphabetic, comprising the steps of: 

receiving indications of telephone keys respectively 
selected in accordance with the intended string of 
characters; 

receiving utterance signals respectively representing 
caller utterances of each of the intended characters for 
disambiguating the received telephone key indications; 

generating selected characters based on the telephone key 
indications and the utterance signals, wherein the step 
of generating includes choosing each of the selected 
characters corresponding to each of the telephone key 
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indications based on utterance signals corresponding to 
previously received telephone key indications; and 
determining whether the generated string of characters 
corresponds to the intended string of characters; 
5 wherein when the generated string corresponds to the 
intended string then the intended string has been cor- 
rectly captured. 

4. A method of capturing an intended string of characters, 
at least one of which is alphabetic, comprising the steps of: 

30 receiving indications of telephone keys respectively 
selected in accordance with the intended characters for 
disambiguating the received telephone key indications; 
receiving utterance signals respectively representing 
caller utterances of each of the characters; 

15 generating selected characters based on the telephone key 
indications and the utterance signals, wherein the step 
of generating includes choosing each of the selected 
characters corresponding to each of the telephone key 
indications based on utterance signals corresponding to 

20 subsequently received telephone key indications; and 
determining whether the generated string of characters 
corresponds to the intended string of characters; 
wherein when the generated string corresponds to the 
intended string then the intended string has been cor- 

25 rectly captured. 

5. A method of capturing an intended string of characters, 
at least one of which is alphabetic, comprising the steps of: 

receiving indications of telephone keys respectively 
selected in accordance with the intended string of 

30 characters, wherein the string has a predetermined 
number of character positions and the character at each 
position is specified by a respective format identifying 
said character as one of alphabetic only, numeric only 
or alphanumeric; 

35 receiving utterance signals respectively representing 
caller utterances of each of the intended characters for 
disambiguating the received telephone key indications; 
generating selected characters based on the telephone key 

indications and the utterance signals; and 
determining whether the generated string of characters 
corresponds to the intended string of characters; 
wherein when the generated string corresponds to the 
intended string then the intended string has been cor- 

45 rectly captured. 

6. Apparatus for capturing an intended string of 
characters, at least one of which is alphabetic, comprising: 

means for receiving indications of telephone keys respec- 
tively selected in accordance with the intended 
50 characters, wherein the telephone key indication is an 
utterance corresponding to a telephone key; 

means for receiving utterance signals respectively repre- 
senting caller utterances of each of the intended char- 
acters for disambiguating the received telephone key 
55 indications; 

means for generating selected characters based on the 
telephone key indications and the utterance signals; and 

means for determining whether the generated string of 
characters corresponds to the intended string of char- 
60 acters; wherein when the generated string corresponds 
to the intended string then the intended string has been 
correctly captured. 

7. The apparatus of claim 6, wherein the telephone key 
utterance represents a number corresponding to the tele- 

65 phone key. 

8. Apparatus for capturing an intended string of 
characters, at least one of which is alphabetic, comprising: 
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means for receiving indications of telephone keys respec- 
tively selected in accordance with the intended char- 
acters; 

means for receiving utterance signals respectively repre- 
senting caller utterances of each of the intended char- 
acters for disambiguating the received telephone key 
indications; 

means for generating selected characters based on the 
telephone key indications and the utterance signals, 
wherein the means for generating includes means for 
choosing selected characters corresponding to the tele- 
phone key indications based on utterance signals cor- 
responding to previously received telephone key indi- 
cations; and 

means for determining whether the generated string of 
characters corresponds to the intended string of char- 
acters; wherein when the generated string corresponds 
to the intended string then the intended string has been 
correctly captured. 

9. Apparatus for capturing an intended string of 
characters, at least one of which is alphabetic, comprising: 

means for receiving indications of telephone keys respec- 
tively selected in accordance with the intended char- 
acters; 

means for receiving utterance signals respectively repre- 
senting caller utterances of each of the intended char- 
acters for disambiguating the received telephone key 
indications; and 

means for generating selected characters based on the 30 
telephone key indications and the utterance signals, 
wherein the means for generating includes means for 



20 



25 



choosing selected characters corresponding to the tele- 
phone key indications based on utterance signals cor- 
responding to subsequently received telephone key 
indications; and 

means for determining whether the generated string of 
characters corresponds to the intended string of char- 
acters; wherein when the generated string corresponds 
to the intended string then the intended string has been 
correctly captured. 

10. Apparatus for capturing a string of characters, at least 
one of which is alphabetic, comprising: 

means for receiving indications of telephone keys respec- 
tively selected in accordance with the intended char- 
acters wherein the string has a predetermined number 
of character positions and the character at each position 
is specified by a respective format identifying said 
character as one of alphabetic only, numeric only or 
alphanumeric; 

means for receiving utterance signals respectively repre- 
senting caller utterances of each of the intended char- 
acters; 

means for generating selected characters based on the 
telephone key indications and the utterance signals; and 

determining whether the generated string of characters 
corresponds to the intended string of characters; 
wherein when the generated string corresponds to the 
intended string then the intended string has been cor- 
rectly captured. 
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